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ABSTRACT 

Understanding and modeling the mechanisms by which di- 
rected social networks evolve axe active areas of research. 
One related emerging topic is to understand and predict 
the formation of reciprocal edges, which has many poten- 
tial applications such as directed social network modeling, 
friend recommendation, information propagation and tar- 
geted spamming. 

However, there have been few studies providing a system- 
atic understanding of reciprocity at scale. In this paper, we 
bridge this gap using a novel large-scale Googlc+ dataset^ 
crawled by ourselves as well as one publicly available social 
network dataset. First, we investigate the structure and evo- 
lution of reciprocal and unreciprocal edges. Specifically, we 
find that reciprocal edges mainly connect users with similar 
degrees while unreciprocal edges mainly link ordinary users 
(e.g., users with low degrees) and popular users (e.g., celebri- 
ties). However, the impacts of reciprocal edges linking or- 
dinary and popular users on the network structures become 
increasingly significant as the social networks evolve. Sec- 
ond, we observe that factors including user behaviors, node 
attributes and edge attributes all have significant impacts 
on the formation of reciprocal edges. Third, in contrast to 
previous studies that treat reciprocity prediction as cither a 
supervised or a semi-supervised learning problem, we iden- 
tify that reciprocity prediction is better modeled as an out- 
lier detection problem. Finally, we perform extensive eval- 
uations with the two datasets, and show that our proposal 
outperforms previous reciprocity prediction approaches. 

1. INTRODUCTION 

Online social networks (e.g., Facebook, Googlc+, Flickr) 
have become increasingly important platforms for interact- 
ing with people, processing information and diffusing social 

*This work was done when Xu was a visiting student in UC 

Berkeley. 

^Wc have made our Google+ dataset publicly available, 
http:/ /www. cs.berkeley.edu/~stevgong/gplus. html 
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influence. Understanding and modeling the mechanisms by 
which these networks evolve are therefore fundamental is- 
sues and active areas of research. 

One related emerging topic is to understand and predict 
the formation of reciprocal edges [5, 11], the growing inter- 
est of which is highlighted by its importance in applications 
such as directed social network modeling, friend recommen- 
dation, information propagation, network compression [6] 
and targeted spamming, etc. 

In spite of the importance of reciprocity in social net- 
works, there have been few studies on it with two recent 
exceptions. One is the work done by Cheng et al. [5]. They 
treated the reciprocity prediction as a supervised learning 
problem. With a network constructed from the directed 
©-messages sent between users on Twitter, they identified 
a set of predictive features such as common neighbors and 
two-step paths extracted from network structure. The other 
nice work was done by Hopcroft et al. [11], who constructed 
a semi-supervised learning model for reciprocity prediction. 
With a directed Twitter follower network, they identified 
more predictive features such as status homophily, interac- 
tions (e.g., retweeting and replying tweets) and structural 
balance [4]. 

However, treating reciprocity prediction as a supervised or 
a semi-supervised learning problem could incur a serious is- 
sue. Specifically, given a network snapshot, these approaches 
treat reciprocal edges as positive examples and sample some 
unreciprocal edges as negative examples to train the mod- 
els. Unfortunately, these sampled unreciprocal edges are 
also test examples in the next snapshot. Moreover, some 
of them might have turned to be positive (i.e., reciprocally 
linked back). As a result, the better the trained models are, 
the worse generalization performances they possibly achieve. 
In fact, in the reciprocity prediction setting, we can only 
observe positive examples. Ftirthermore, the structure and 
evolution of reciprocal and unreciprocal edges are largely un- 
explored. It's also unclear that how user behaviors (e.g., ten- 
dencies of reciprocally linking back), node attributes (e.g., 
school, employer and major derived from users' profiles) and 
edge attributes (e.g., edge age) influence the formation of re- 
ciprocal edges. 

In this paper, we first study the structure and evolution 
of reciprocal and unreciprocal edges in Google+ and Flickr 
social networks. Specifically, we find that reciprocal edges 
mainly link users with similar degrees while unreciprocal 
edges mainly connect ordinary users (i.e., users with low de- 
grees) and popular users (e.g., celebrities). However, the im- 
pacts of reciprocal edges linking ordinary and popular users 




Figure 1: Illustration of friend requests, friend acceptances, 
parasocial edges and reciprocal edges. 

on the network structures become increasingly significant as 
the social networks evolve. Second, we observe that user 
behaviors, node attributes and edge attributes all have sig- 
nificant influences on the formation of reciprocal edges. For 
example, sharing common schools triples the probability of 
reciprocally linking back to an unreciprocal edge. These 
measurement results inform us the designs of reciprocity 
prediction features. Third, in contrast to previous studies 
that treat reciprocity prediction as either a supervised or a 
semi-supervised learning problem, we model reciprocity pre- 
diction as an outlier detection problem. Finally, we perform 
extensive evaluations with the Google+ and Flickr datasets, 
and demonstrate that our proposal outperforms previous 
reciprocity prediction approaches. 

2. NOTATIONS AND DATASETS 

In this section, we begin with the introduction of a few 
notations. Then, we describe our novel Google+ dataset and 
the publicly available Flickr dataset. 

2.1 Notations 

We denote a directed social network as G = {V, E), where 
V and E are respectively the set of nodes and edges. We 
will elaborate how the nodes and edges are constructed when 
describing the datasets. Moreover, the snapshot of G at time 
t is denoted as G'*'. 

A directed edge (u, v) is called as a friend request if the 
reverse directed edge (v, it) does not exist yet or appears af- 
ter (u, v), otherwise it's called as a friend acceptance. More- 
over, following the terminology in social science [12], we also 
classify edges be to parasocial and reciprocal. Specifically, a 
directed edge (u, v) is called parasocial if the reverse directed 
edge (v,u) does not exist yet, otherwise it's called recipro- 
cal. According to the definitions, parasocial edges are friend 
requests that haven't been accepted yet. Figure 1 illustrates 
these concepts. For instance, in G'*' , (w, u), (w, v) and (w, u) 
are friend requests, {u,w) is a friend acceptance, {w,u) and 
(u, w) are reciprocal edges and {w, v) and («, u) are paraso- 
cial edges. 

We can undirect a directed social network G = {V, E) 
to two undirected versions, i.e., parasocial version Gp — 
{Vp,Ep) and reciprocal version Gr ~ {Vr,Er), which satisfy 
that Vp = Vr = V, Ep = {{u,v)\{u,v) e E or {v,u) € E} 
and Er — {{u,v)\{u,v) £ E and {v,u) £ E}. Intuitively, 
the reciprocal version consists of undirected reciprocal edges 
while the parasocial version includes both undirected recip- 
rocal and parasocial edges. 

For a node u, we denote its incoming neighbors as Pi (u) — 
{v\{v,u) G E} and mdegree as di{u) — \ri{u)\, outgoing 
neighbors as To{u) = {v\{u, v) £ E} and outdegree as doiu) — 



|ro(u)|, parasocial neighbors as Tp{u) — ri{u) UVo{u) and 
parasocial degree as dp{u) — |rp(u)|, and reciprocal neighbors 
as Tr{u) = Ti{u) n Foiu) and reciprocal degree as dr{u) = 
|r,.(n)|. 

Assume nodes also have binary attributes (e.g., Google 
Inc., Computer Science, etc.). For each binary attribute a, 
we denote its social neighbors Fa (a) as the set of nodes in 
V that have the attribute a, and social degree as ds{a) = 
|rs(a)|. Furthermore, we denote the set of attributes of a 
node u as Ta{u) and attribute degree as da(u) = |ra(w)|. 

2.2 Datasets 

Google+: Google-|- was launched with an invitation-only 
test phase on June 28, 2011, and opened to everyone 18 years 
of age or older on September 20, 2011. Google-|- provides 
each user with an incoming friend list (i.e., "have you in 
circles"), an outgoing friend list (i.e., "in your circles") and 
a profile page. We began to crawl daily snapshots of public 
Google-|- social network structures and user profiles shortly 
after it was launched; our dataset consists of 79 snapshots 
crawled from July 6 to October 11, 2011 (i.e., 98 days). The 
first snapshot was crawled by breadth-first search (without 
early stopping). On subsequent days, we expanded the social 
structure from the previous snapshot. For most snapshots, 
our crawl finished within one day as Google did not limit 
the crawling rate during that time. The 79 snapshots are 
denoted as G'"', G^^\ ■ ■ ■ , G^^''\ where superscripts are the 
normalized crawling dates of the snapshots. Note that 19 
snapshots are missing in our dataset. 

We take each user u in Google+ as a node, and connect 
it to her outgoing friends via outgoing edges and incoming 
friends via incoming edges. Apart from the social structure, 
nodes also have attributes derived from users' profiles. We 
adopt four attribute types, i.e.. School, Major, Employer 
and City. Specifically, we find all distinct schools, majors, 
employers and cities that appear in at least one crawled user 
profile and use them as binary attributes. Around 22% of 
users have nonempty profiles and thus have observed node 
attributes in the last snapshot. 

Gong et al. [10] roughly divided the evolution of Google-|- 
into three phases: Phase I from day 1 to day 20, which cor- 
responds to the early days of Googie-|- whose size increased 
dramatically; Phase II from day 21 to day 75, during which 
Google-|- went into a stabilized increasing phase; and Phase 
III from day 76 to day 98, when Google-f was opened to 
public (i.e., without requiring an invitation), resulting in a 
dramatic growth again. We point out the three phases be- 
cause we will show in Section 3 that reciprocal edges also 
evolve differently in them. 

Flickr: Flickr is a photo-sharing site based on a social net- 
work, and it provides each user a friend list. This Flickr 
dataset, obtained from Mislove et al. [18], consists of two 
parts. One has 32 snapshots crawled daily between Novem- 
ber 2nd, 2006 and December 3rd, 2006, and the other has 
102 snapshots crawled daily between February 3rd, 2007 and 
May 18th, 2007. Since there is a two-month gap between the 
two parts, we adopt the second one in our study to avoid 
the influence of the missing data. Moreover, we denote these 
snapshots as G^°\G^^\- ■ ■ , G'^°^'. We take each user as a 
node and connect it to her friends via outgoing edges. Note 
that this Flickr dataset doesn't have node attributes infor- 
mation. 



Dataset summary: Table 1 shows the statistics of a 
few basic network metrics of the largest snapshots of the 
Google+ and Flickr datasets. Reciprocity in the table is the 
fraction of friend requests that are already accepted. 

3. STRUCTURE AND EVOLUTION 

In this section, we study both global and local structures 
and evolutions of reciprocal and parasocial edges. Specif- 
ically, we explore the global one via studying degree ho- 
mophily of the parasocial and reciprocal versions of the di- 
rected online social networks; we probe the local one via 
looking into the corresponding clustering coeffients. 

3.1 Global structure and evolution 

To this end, we look into degree homophily of the paraso- 
cial and reciprocal versions of the directed social networks. 
Degree homophily characterizes if two linked nodes tend 
to have similar degrees. Furthermore, assortativity coeffi- 
cient [21] ranging from -1 to 1 is used to quantify degree ho- 
mophily. Intuitively, we can roughly classify users in online 
social networks into two groups, i.e., ordinary users (e.g., 
low-degree users) and popular users (e.g., celebrities). Pos- 
itive assortativity indicates that edges tend to link users 
within the same group; negative assortativity represents that 
edges prefer to connect users across the two groups; neutral 
assortativity means edges link two users without considering 
which groups they belong to. 

In directed social networks, each node has 4 types of de- 
grees, i.e., outdegree, indegree, parasocial degree and recip- 
rocal degree. So there are 16 types of degree homophily 
(e.g., outdegree-indegree homophily, indegree-indegree ho- 
mophily). Previous work [19, 10] explored the outdegree- 
indegree homophily and found that Flickr has positive as- 
sortativity and Google+ has neutral assortativity. How- 
ever, the outdegree-indegree homophily doesn't inform us 
the structure of reciprocal nor parasocial edges. Differently, 
we study the parasocial-parasocial and reciprocal-reciprocal 
degree homophily. 

Table 1 shows the assortativity coefficients of parasocial 
and reciprocal versions of the two online social networks. We 
observe that they both have qualitatively different paraso- 
cial and reciprocal assortativities, i.e., parasocial assortativ- 
ity is neutral while reciprocal assortativity is high positive. 
This phenomena implies that reciprocal edges are mainly 
intra-group ones (i.e., linking users within the same group) 
while parasocial edges are more likely to be inter-group ones 
(i.e., linking users across the two groups). The dominating 
intra-group reciprocal edges make the reciprocal assortativ- 
ity high positive. Recall that the parasocial version consists 
of both the undirected parasocial and reciprocal edges, thus 
the dominating inter-group parasocial edges neutralize the 
dominating intra-group reciprocal edges, resulting in a neu- 
tral parasocial assortativity. 

Figure 2 further illustrates the evolution of the assorta- 
tivity coefficients in Google-l-^ and Flickr. Again, we ob- 
serve that parasocial and reciprocal assortativity coefficients 
evolve qualitatively differently. Note that reciprocally link- 
ing back to a parasocial edge only influences the recipro- 



^The X-axis of the Google-|- evolution figure spans over 
around 100 days although this Google-|- dataset only has 
79 daily snapshots because we use the actual crawling date 
of each snapshot. 
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Figure 2: Evolution of assortativity coefficients in (a) 
Google-f and (b) Flickr. 
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Figure 3: Evolution of average clustering coefficients in (a) 
Google-I- and (b) Flickr. 

cal assortativity while parasocial edges only influence the 
parasocial assortativity. 

Recall that the evolution of Google-|- is divided into three 
phases which is described in Section 2.2. Parasocial assor- 
tativity keeps stable in Phase I and Phase II and slightly 
decreases in Phase III. However, reciprocal assortativity in- 
creases dramatically in Phase I and decreases in Phase II and 
Phase HI. This implies that new inter-group and intra-group 
parasocial edges neutralize each other in Phase I and Phase 
II; after opening to the public (i.e., in Phase HI), new inter- 
group parasocial edges slightly dominate the intra-group 
ones. Furthermore, new intra-group reciprocal edges signifl- 
cantly dominate inter-group ones in Phase I; new inter-group 
reciprocal edges slightly dominate the intra-group ones in 
Phase II and Phase HI. 

In Flickr, parasocial assortativity slightly increases while 
reciprocal assortativity slightly decreases. Different from 
Google-f, this implies that new intra-group parasocial edges 
dominate new inter-group ones in Flickr. Similar to Google-|-, 
the decreasing reciprocal assortativity could imply that new 
inter-group reciprocal edges dominate intra-group ones. 

Both online social networks imply that the impacts of 
inter-group reciprocal edges on the network structures be- 
come more and more significant as time evolves. 

3.2 Local structure and evolution 

We study the local structure and evolution of reciprocal 
and parasocial edges via looking into the clustering coeffi- 
cients, which characterize how the neighbors of a node are 
connected. Given an undirected social network G — {V, E) 
and a node u, u's clustering coefficient is defined as c{u) = 
r(n)K|r(ij)|-i) ' where L{u) is the number of edges among u's 
neighbors r(u). The average clustering coejficient is defined 



Table 1: Statistics of a few basic network metrics of the largest snapshots of Google+ and Flickr. 





#Nodes 


#Edges 


Reciprocity 


Assort, coeff. 


Aver, clust. coeff. 


Par asocial 


Reciprocal 


Par asocial 


Reciprocal 


Go ogle + 


29,627,807 


473,888,579 


0.376 


-0.014 


0.301 


0.298 


0.296 


Fhckr 


2,302,925 


33,140,018 


0.451 


-0.003 


0.127 


0.361 


0.297 



as C = 



\V\ ^uev '^i''^) [25]. Intuitively, this is the average 
probability that a random pair of neighbors of a random 
node is connected. 

Previous work [21, 19, 10] found directed social networks 
have high clustering coefficients. However, these studies can- 
not demonstrate the local structure of reciprocal and paraso- 
cial edges. To this end, we study clustering coefficients of 
the parasocial and reciprocal versions of social networks. 

To determine the clustering coefficient of u, we need its 
degree and the number of edges among its neighbors. On 
one hand, parasocial edges increase it's parasocial degree. 
On the other hand, parasocial edges also increase the num- 
ber of edges among m's parasocial neighbors. So one natural 
question is which one plays a more important role in deter- 
mining the clustering coefficient. 

Table 1 shows the average clustering coefficients of the 
parasocial and reciprocal versions of the two social networks. 
We find that parasocial clustering coefficient is larger than 
the reciprocal one in both networks. Our observation indi- 
cates that parasocial edges, although making nodes' paraso- 
cial neighbors more than their reciprocal neighbors, connect 
the parasocial neighbors more tightly. Note that Cheng et 
al. [5] observed that the reciprocal clustering coefficient is 
much higher than the parasocial one in a Twitter interaction 
network, where nodes are Twitter users and a directed edge 
(m, v) means u has sent some ©-messages to v. This implies 
that friendship networks which are our cases are different 
from the interaction network in terms of the local structure 
of reciprocal and parasocial edges. 

Figure 3 illustrates evolutions of the clustering coefficients 
in Google-f and Flickr. We observe that parasocial and re- 
ciprocal clustering coefficients evolve in similar patterns. In 
Google-f-, both of them decrease dramatically in Phase I and 
Phase III and increase stably in Phase II. This implies that 
both of users' parasocial and reciprocal neighbors become 
more and more loosely connected in Phase I and Phase III 
while turning to be increasingly tightly connected in Phase 
II. In Flickr, both parasocial and reciprocal clustering co- 
efficients decrease over time, which indicates that users in 
Flickr have increasing number of neighbors and these neigh- 
bors are more and more loosely connected. 

3.3 Summary and implications 

In summary, we find that reciprocal edges mainly connect 
users with similar degrees while parasocial edges mainly link 
ordinary users and popular ones. However, the impacts of 
reciprocal edges linking ordinary and popular users on the 
network structures become increasingly significant as the so- 
cial networks evolve. Moreover, parasocial edges, although 
making nodes' parasocial neighbors more than their recipro- 
cal neighbors, connect the parasocial neighbors more tightly. 

Our findings have significant implications for directed so- 
cial network modeling. Existing directed social network 
models only try to match directed network metrics, thus 
possibly cannot match network metrics of the corresponding 
parasocial and reciprocal versions. For instance, we observe 
that the parasocial and reciprocal versions of directed social 
networks behave qualitatively differently with respect to as- 



sortativity coefficients. Unfortunately, we find that existing 
directed social network models (e.g., [14, 10]) cannot cap- 
ture such assortativity behavior. Our observations of the 
global and local structures of the reciprocal and parasocial 
edges could give us insights on the designs of better directed 
social network models. 

4. WHAT IMPACTS RECIPROCITY 

The previous section explores the structure and evolution 
of reciprocal and parasocial edges. These studies concern 
about the quantities of reciprocal and parasocial edges. Or- 
thogonally, we study what factors impact the probability of 
linking back to a parasocial edge in this section. Hopcroft et 
al. [11] found that such probability is infiuenced by the net- 
work structure, users' social status and interactions (e.g., 
retweeting and replying tweets in Twitter) between users, 
etc. In the following, we study the impacts of user behav- 
iors, node attributes and edge attributes on the linking-back 
probability. These studies give us insights on how to extract 
features when predicting reciprocity. 

4.1 User behavior 

Intuitively, users in online social networks could behave 
very differently in terms of issuing and accepting friend re- 
quests. For instance, one speculation is that users having 
higher tendencies to accept friend requests in the past are 
also more likely to accept new friend requests in the fu- 
ture; users whose friend requests were more likely to be 
accepted in the past will have higher probabilities of their 
friend requests being accepted in the future. To quantify 
these intuitions, we define respectively request and accep- 
tance local reciprocity as Rr{u) = dr{u) / do{u) and Ra{u) = 
driu) / diiu). Please refer to Section 2 for the defirutions of 
do^di and dr- Rr{u) characterizes the likelihood that m's 
friend requests are accepted; Ra{u) characterizes the prob- 
ability with which u accepts a friend request. 

To demonstrate the impact of local reciprocities on the 
formation of reciprocal edges, we calculate the probability 
of linking back to a parasocial edge as a function of local 
reciprocities. Specifically, we first discretize the local reci- 
procity ranging from to 1 to 20 bins. For each bin b, we 
collect all parasocial edges in G'^"^ whose tail points have 
Rr{u) in b or head points have Ra{u) in b, and compute 
the corresponding linking-back probability as the fraction 
of them that turn to be reciprocal in the last snapshot of 
Google-|- or Flickr. Figure 4a shows the linking-back prob- 
ability as a function of acceptance local reciprocity. We 
conclude that the linking-back probability increases as ac- 
ceptance local reciprocity increases. For instance, in Flickr, 
users that always reciprocally linked back to friend requests 
(i.e., users with acceptance local reciprocity 1) in the past 
also always do so in the future. The results of request lo- 
cal reciprocity are similar, but are not shown here due to 
the limited space. These findings support our speculations 
and the predictiveness of the local reciprocity for reciprocity 
prediction. 

4.2 Node attributes 
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Figure 4: The impacts of (a) acceptance local reciprocity, (b) node attributes and (c) edge age on the formation of reciprocal 
edges, (b) was obtained with only the Google+ dataset since the Flickr dataset doesn't have node attributes. 



We have node attributes in only Google+ dataset, on 
which we will focus in this section. Note that 78% of the 
Google+ nodes have no available attributes. In order to 
avoid the influences of these missing attributes, we use nodes 
with at least one attribute and edges between them. 

Recall that each node in the Google+ dataset could have 
four attribute types (i.e.. School, Major, Employer and City). 
Figure 4b shows their impacts on the formation of recipro- 
cal edges. The figure was computed as follows: we find all 
parasocial edges whose end points have at least 1 attribute 
and share or at least 1 attribute of some attribute type in 
G^^°\ then the corresponding linking-back probability is the 
fraction of such edges that are reciprocal in the last snap- 
shot. We observe that the four attribute types have different 
impacts on the formation of reciprocal edges. For instance, 
sharing the attribute type School triples the linking-back 
probability while sharing City just increases the probability 
by one third. On one hand, these findings indicate the node 
attributes are useful for predicting reciprocity. On the other 
hand, they inform us to consider the four attribute types 
seperately. Note that Hopcroft et al. [11] found that shar- 
ing the same time zone doesn't increase the linking-back 
probability in Twitter. However, we observe that sharing 
cities which are more fine-grained location information does 
increase the linking-back probability. 

4.3 Edge attributes 

In general, edge attributes could be messages sent from 
u to V, comments/likes u writes to v's posts/pictures and 
replying tweets, etc. For instance, Hopcroft et al. [11] found 
that retweeting or replying users' tweets increases the prob- 
ability of linking back to a parasocial edge in Twitter. 

Here, in the Google-I- and Flickr datasets, we treat the 
age of an edge {u, v) as its edge attribute. Similar to the 
measurement studies on local reciprocity and common at- 
tributes, we ensemble all parasocial edges in G'^"' whose 
ages are a, then we compute the linking-back probability 
with respect to a as the fraction of these edges that become 
reciprocal in the last Google-|- or Flickr snapshot. Since the 
time resolution of both datasets is a day, we compute the 
edge ages with respect to days. For instance, edges with age 
are the new edges in gure 4c shows the linking- 

back probability decreases dramatically as the edge age in- 
creases from to 3 and decreases relatively slowly when 
the edge age ranges from 4 to 20. For instance, linking- 
back probability of parasocial edges with age is around 
100 times higher than that of the parasocial edges with age 



20 in Flickr. These results imply that edge age is useful for 
predicting reciprocity. 

5. PREDICTING RECIPROCITY 

In this section, we study the prediction of reciprocal edges. 
First, we formally define the reciprocity prediction problem. 
Second, we discuss the extraction of features. Each of the 
features is supported by either previous work or our mea- 
surement studies in Section 4. Third, we map the reciprocity 
prediction to an outlier detection problem. 

5.1 Reciprocity prediction definition 

We denote the social network snapshot at time t as G'*'. 
Furthermore, let yuv = if the directed edge {u,v) is paraso- 
cial at time t, otherwise t/S = 1. The following statement 
formally defines the reciprocity prediction problem. 

Definition 1. Given a set of snapshots G^^\G^^\- ■■ ,G^*\ 
reciprocity prediction is to predcit yuv'^^ G {0, 1} for all 
parasocial edges at time t. 

5.2 Features 

We extract two categories of features for each directed 
edge. The two categories are single-node features and node- 
pair features. Table 2 summarizes these features. Our new 
features are indicated by the star marker * in the table. In 
the following, we will elaborate them one by one. 

5.2.1 Single-node features 

These features are extracted for each node individually. 
For an edge {u,v), the single-node features of both u and v 
are concatenated. 

Degrees [5]: Cheng et al. has shown that indegree and 
outdegree and their ratio are useful features for reciprocity 
prediction. Moreover, Hopcroft et al. [11] also found that 
high-degree users link back to high-degree users with a higher 
probability. Following Cheng et al., we extract di{u), do{u), 
do{u) / di{u) for each node u as features. 

Local reciprocity: We have shown in Section 4.1 that 
local reciprocities impact the formation of reciprocal edges 
significantly. So we extract both acceptance and request 
local reciprocities as features. 

5.2.2 Node-pair features 

We extract three categories of node-pair features for each 
edge (u, v). They are structural features, node-attribute and 
edge-attribute features. 



Table 2: Summary of the features. Our new features are indicated by the star marker *. 



Categories 


Names | Notations | Definitions 


Single-node 


Degrees 


Indegree 


di{u) 




Outdegree 


do(u) 


|ro(«)| 


Outdcgrcc/Indegree ratio 


do{u)/di{u) 


|ro(«)|/|r,(«)| 


Local reciprocities* 


Acceptance reciprocity 


Ra{u) 


dr{u)/di{u) 


Request reciprocity 


Rr{u) 


dr{u)/do{u) 


Node-pair 


Structures 


Common neighbors (CN) 


CNxy{u,v) 


\rx{u)nry[v)\, x,y£ {i,o,p,r}. 


Jaccards coefficients (JC) 


JCxy(u,v) 


\rx{u)nry{v)\/\r.4u)ury{v)\,x,ye{i,o,p,r} 


Adamic-Adar (AA) 


AAxyz{u, v) 


Sioerx(u)ni'„(i;) iog(\r^(w)\)' -^'V'" C [i',o,p,r] 


Preferential attachment (PA) 


do{u) ■ di(v) 
di{u) ■ do{v) 




JNode att 


ributes* 


Common attri. neighbors (CN-A) 


CN - Aa{u,v) 


|ra(M) n ra(D)| for four attri. types 


Attri. Jaccards coefficients (JC-A) 


JC - Aa{u, v) 


|ra(«) n ra{v)\/\ra{u) U ra{v)\ for four attri. types 


Attri. Adamic-Adar (AA-A) 


AA - Aa(u, v) 




Edge attributes* 
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Structural features: We extract a few classical link pre- 
diction features [16, 9] such as common neighbors, jaccard 
coefHcients, Adamic-Adar scores and preferential attachment 
as structural features. 

• Common neighbors (CN) In directed social networks, 
there are four types of neighbors for a node, i.e., incom- 
ing, outgoing, parasocial and reciprocal neighbors. Thus, 
two nodes u and v could have 4x4= 16 types of com- 
mon neighbors. We denote by CNxy{u, v) these 16 types 
of common neighbors, where x,y € {i, o,p, r}. Note that 
CNoi{u,v) is equivalent to the number of two-step di- 
rected paths from u to v, which was shown to be useful 
for reciprocity prediction [5]. 

• Jaccard's coefRcients (JC) [22] Jaccard's coefficient 
is a commonly used similarity metric in information re- 
trieval. The intuition behind the Jaccard's coefficient 
is to penalize the common social neighbors by the total 
number of social neighbors the two users have. Formally, 
■^^^^f = Ir^Wur^wl ' ■«^liere x,y € {i, o, p, r}. Since each 
node has 4 kinds of social neighbors, we have 16 types of 
Jaccard's coefficients. 

• Adamic-Adar (AA) [1] Intuitively, we want to down- 
weight the importance of neighbors that are social hubs. 
AA score quantifies this intuition as 

AAxy.iu,v)= J2 logi\Tz{w)\) 

, where x,y,z € {i,o,p,r}. Totally, we have 64 AA fear- 
tures. 

• Preferential attachment (PA) PA is empirically ob- 
served to be a basic mechanism that edge formation fol- 
lows in various networks [10, 20, 3, 13]. As was proposed 

by Cheng et al. [5], we calculate PA{u,v) = do{u) • di{v) 
and PA{v,u) — di{u) ■ do{v) as features. 

Note that Cheng et al. [5] only extracted CNu, CNoo, 
JCii, J Coo, AAiii and PA as structural features. Hopcroft 
et al. [11] used CNrr as features. 

Node-attribute features: We have node attributes in 
only Google-I- dataset, on which we will focus in this sec- 
tion. However, we want to stress that our ways of extract- 



ing node attributes features can be naturally generalized to 
other social networks. 

• Common attribute neighbors (CN-A) Recall that 
we have shown in Figure 4b that the four attribute types 
(i.e.. School, Major, Employer and City) have different 
impacts on the formation of reciprocal edges. So we con- 
sider them seperately. Specifically, we extract the num- 
ber of common attribute neighbors for each attribute 
type as features. 

• Attribute Jaccard's coefHcients (JC-A)/Adamic- 
Adeu- (AA-A) Similar to structural JC and AA fear- 
tures, we downweight the importance of a common at- 
tribute with either the total number of attributes the two 
users have or the number of social neighbors of the at- 
tribute. Formally, attribute JC is defined as JC — Aa{u, v) 
= [r°(")ur°(")[ ; attribute AA is defined as AA — A„(w, v) 

= E6er„(„)nr„(„) ioa(\r,{b)\) ■ ^-gain, we extract these 
features for the four attribute types seperately. 

Edge-attribute features: We have shown in Figure 4c 

that edge age impacts the formation of reciprocal edges sig- 
nificantly. So we extract a feature from the edge age. For a 
parasocial edge, this feature is simply its age. However, it's 
trickier to extract this feature for a reciprocal edge. Note 
that most of the reciprocal edges could have large ages since 
they might appear a long time ago, making them indistin- 
guishable with large-age parasocial edges. However, a re- 
ciprocal edge might have a small age at the time when it 
became reciprocal. So we extract this age as the feature. 

5.3 Modeling reciprocity prediction as outlier 
detection 

Previous studies treat reciprocity prediction as cither a 
supervised [5] or a semi-supervised learning problem [11]. 
With snapshot G'*^ , their approaches treat reciprocal edges 
as positive examples and sample some parasocial edges as 
negative examples when training the models. However, the 
parasocial edges in G'*-* are also test examples in G''^^' and 
some of the sampled ones might have become positive in 
G('+i\ Fi gure 1 demonstrates such an issue. In G('), edge 
{w,u) is a positive example, and edge {v,u) is sampled as 
a negative example. However, {v,u) turns to be positive 



in the test snapshot G^'^^^K As a result, the better their 
trained models are, the worse generahzation performances 
they possibly achieve. 

Actually, in the setting of reciprocity prediction, we can 
only observe positive examples (i.e., reciprocal edges). So 
we propose to model reciprocity prediction as an outlier de- 
tection problem with known positive examples. We'll use 
one-class Support Vector Machine [17] as an outlier detec- 
tor. 

6. EVALUATIONS 

In this section, we show our experimental results of reci- 
procity prediction. First, we introduce our experimental 
setup. Then we compare our proposal with previous ap- 
proaches. 

6.1 Experimental setup 

In the following, we will cover the construction of the 
training and tost datasets, approaches we compare our pro- 
posal with, data normalization techniques we apply to the 
feature matrix, and metrics adopted. 

Training and test: According to the definition, reciprocal 
edges exist as pairs of edges. For instance, both (w, u) and 
(w, w) in Figure 1 are reciprocal edges. However, to make 
our description more clear, for a pair of reciprocal edges, we 
only call the one that appeared earlier as a reciprocal edge 
in the entire Section 6. To continue the above example, 
only {w, u) is called reciprocal since it appeared earlier than 
(u.w). Intuitively, these newly defined reciprocal edges are 
friend requests that are already accepted. However, we keep 
the original definition of parasocial edges. 

Around 78% of users have no available node attributes in 
the Google-|- dataset. Thus, in order to avoid the influences 
of the missing attributes, we further preprocess Google+ via 
only keeping nodes with at least one attribute and edges be- 
tween them. However, we should point out that the features 
of an edge are still extracted from the original Google-f-. 

We use G'^"' of Flickr and the preprocessed Google-f as 
the training and validation data. Note that the parasocial 
edges in G'-'^"-' are also the test dataset. We choose G'-'^"' to 
construct the training and test data because we can extract 
edge age features for the edges in it and the parasocial edges 
(i.e., test data) have enough time to be reciprocal in the last 
snapshots if they would be. 

In the training phase, we treat the reciprocal edges in 
as positive training examples. These positive exam- 
ples are enough to train the outlier detection model. How- 
ever, supervised and semi-supervised learning models also 
require negative examples. So, as was done in [5, 11], we 
sample aP parasocial edges in G^^"' and treat them as neg- 
ative examples, where P is the number of positive training 
examples. Note that some of the sampled training negative 
examples are also test positive examples, possibly resulting 
in bad generalization performances. So we design two sam- 
pling strategies, i.e., random and edge-age sampling. Ran- 
dom sampling means we sample the aP negative examples 
uniformly at random. Recall that Figure 4c shows parasocial 
edges with larger edge ages are less likely to be reciprocal 
in the future. So, in order to reduce the number of sam- 
pled training negative examples that are also test positive 
examples, we design edge-age sampling, which samples the 
aP negative examples with the largest edge ages. We use 



Table 3: Statistics of the training and test datasets. 





Training 


Test 


Reciprocal 


Parasocial 


Reciprocal 


Parasocial 


Google-f- 


11,993,458 


16,250,289 


1,186,903 


15,063,386 


Flickr 


8,211,426 


9,906,869 


139,762 


9,767,107 



2-fold cross validation and grid search to learn the model 
hyperparamcters. Since this proccsure is time-consuming, 
we perform it with 10% of the training examples sampled 
uniformly at random. Then we retrain the models on the 
full training data with the learned hyperparamcters. In the 
test phase, a parasocial edge in G*-^"^ is a test positive ex- 
ample if it's reciprocal in the last snapshot of Google-I- or 
Flickr, otherwise it's a test negative example. 

Table 3 shows the statistics of the training and test datasets. 
7.3% and 1.4% of the training parasocial edges turn to be 
tost reciprocal ones in Googlc+ and Flickr respectively. These 
edges make the supervised and semi-supervised learning ap- 
proaches achieve bad test performances. We note that 9.9% 
of the parasocial edges in G'-^'^' turn to be test reciprocal 
ones in the original Google-|- dataset. 

Comparisons: We compare the following approaches. 

• SVM-R Binary SVM [7] with the random sampling strat- 
egy to sample the negative examples. 

• SVM-A Binary SVM with the edge-age sampling strat- 
egy- 

• TriFG-R TriFG [11] is a semi-supervised learning frame- 
work based on a factor graphical model. Apart from all 
the features discussed in Section 5.2, TriFG also incorpo- 
rates structural balance [4] by modeling it as a factor in 
the factor graphical model. So we also extract structural 
balance features as was done in Hopcroft [11] when test- 
ing TriFG. TriFG-R samples the negative examples with 
the random sampling strategy. 

• TriFG-A TriFG with the edge-age sampling strategy. 

• OC-SVM One-class Support Vector Machine [17] is an 
outlier detector with known positive examples. Note that 
we don't need to sample negative examples for OC-SVM. 

We only consider linear kernels for SVM and OC-SVM 
due to scalability issues. We use LIBSVM^ for OC-SVM 
implementation and LIBLINEAR'' for SVM implement. LI- 
BLINEAR's linear SVM implementation is much faster than 
LIBSVM's. However, LIBLINEAR doesn't have OC-SVM 
implementation for now. TriFG implementation was ob- 
tained from the authors [11]. All these algorithms wore run 
on a machine with 500GB main memory and 32 cores. 

Data normalization: We ensemble all the feature vectors 

into a feature matrix, rows of which correspond to edges and 
columns of which correspond to features. It's well known 
that the performances of many machine learning algorithms 
are sensitive to data normalizations. So we apply three nor- 
malization techniques to this feature matrix. They are i) 
column normalization, which normalizes each column of the 
feature matrix to have mean and variance 1, ii) row nor- 
maliztion, which normalizes each row to have L2 norm 1, 
and iii) column-row normaliztion, which sequentially applies 
column and row normalizations to the feature matrix. 
We find that algorithms tested in the following perform 

^http://www.csie. ntu.edu. tw/~cjlin/libsvm/ 
''http://www.csie.ntu.edu.tw/~cjlin/liblinear/ 



0.9 
0.8 
0.7 

(£o.c 

0.5 
0.4 
0.3 















4-^ SVM-R 
■fc* SVM-A 

TriFG-R 
¥^ TriFG-A 
•— • OC-SVM 



0.01 0.1 0.5 1.0 

Sampling ratio q 

(a) Precision in GoogleH 




0.01 0.1 0.5 1.0 
Sampling ratio a 

(b) Recall in Google+ 




0.01 0.1 0.5 1.0 

Sampling ratio q 

(c) Fl in Google+ 




0.01 0.1 0.5 1.0 

Sampling ratio a 

(d) Precision in Flickr 




'ifa 



0.01 0.1 0.5 1.0 
Sampling ratio q 

(e) Recall in Flickr 



0.,'i 
0.2 
0.1 



4^ SVM-R 

Hmk SVM-A 
^-^ TriFG-R 
■W^ TriFG-A 
•-• OC-SVM 











0.01 0.1 0.5 1.0 

Sampling ratio r\ 

(f) Fl in Flickr 



Figure 5: Comparisons between our proposal and previous approaches in the Google+ and Flickr social networks. 



the best with different normalization techniques. However, 
we will only show the results with the best normalization 
due to the limited space. 

Metric: The number of reciprocal and parasocial edges are 
highly imbalanced in the test phase. Thus accuracy is not 
an appropriate metric. For instance, a naive model which 
always outputs negative can already achieve test accuracy 
0.986 in Flickr and 0.927 in Google+. So, as was done in 
Hopcroft et al. [11], we adopt Precision, Recall and Fl as 
the metrics. Precision is the portion of predicted positive 
examples that are true reciprocal edges. Recall is the portion 
of true reciprocal edges that are predicted as positive. Fl 
score is the harmonic mean of Precision and Recall. 

6.2 Comparison results 

Figure 5 shows the test performances of the approaches 
as functions of the sampling ratio a. a > 1.5 corresponds 
to the scenario in which all parasocial edges are sampled as 
negative examples. We have the following observations. 
OC-SVM v.s. SVM and TriFG: OC-SVM achieves 
better Fl scores than SVM and TriFG approaches in both 
Google-1- and Flickr datasets. On one hand, SVM and TriFG 
approaches sample some parasocial edges as negative ex- 
amples in the training phase. However, around 7.3% and 
1.4% of them are also test positive examples in Google-|- and 
Flickr respectively, which cause bad test performances. On 
the other hand, OC-SVM doesn't use the parasocial edges 
in the training phase, thus avoids this issue. 

Random sampling v.s. edge-age sampling: Edge-age 
sampling achieves better Recall than random sampling for 
both SVM and TriFG approaches. We take SVM with the 
sampling ratio a = 1.0 and Google-|- as an example to il- 
lustrate the reason. Specifically, 875,989 sampled negative 
examples are actually test positive examples with random 
sampling and 580,726 of them are classified as negative in 
the training and test phases. However, the number of such 
examples decreases to 567,620 with edge-age sampling and 



440,346 of them are classified as negative. The behavior of 
Precision is more complicated. Specifically, edge-age sam- 
pling slightly helps TriFG but makes SVM perform worse. 
The reason might be that edge-age sampling samples a bi- 
ased subset of the parasocial edges, which decreases SVM- 
A's Precision. 

Impacts of the sampling ratio: For SVM approaches. 
Precision increases and Recall decreases as the sampling ra- 
tio Q increases. On one hand, a larger a treats more paraso- 
cial edges as training negative examples and thus classifies 
more test reciprocal edges as negative, which explains the 
decreasing Recall. On the other hand, a larger a also cor- 
rectly classifies more test parasocial edges as negative and 
thus decreases the number of test parasocial edges among 
the predicted reciprocal edges, which increases the Precision. 
However, the behavior of the Fl score depends on datasets. 
Specifically, SVM's performances increase in Google-|- but 
decrease in Flickr when a goes to 0. Interestingly, TriFG 
approaches are relatively robust to the selection of a with 
respect to all the three metrics. We speculate the reason is 
that TriFG incorporates structural balance information. 

7. RELATED WORK 

Measuring reciprocity: Garlaschelli and Loffredo [8] 
measured reciprocity in various networks (e.g., WWW, email 
networks, etc.). Akoglu et al. [2] quantified reciprocity in 
weighted communication networks (e.g., phone call network). 
Cheng et al. [5] found that reciprocal version has a higher 
average clustering coefficient than parasocial version in their 
Twitter interaction network, which implies that friendship 
networks (e.g., Google-|- and Flickr) and the interaction net- 
work have different local structures of reciprocal and paraso- 
cial edges. Hopcroft et al. [11] found that, in a Twitter fol- 
lower network, network structure, users' social status and 
interactions infiuence the formation of reciprocal edges. 

Predicting reciprocity: Cheng et al. [5] treated red- 



procity prediction as a supervised learning problem. Hopcroft 
at al. [11] constructed a semi-supervised learning framework 
for reciprocity prediction. These approaches require nega- 
tive training examples, which are sampled from parasocial 
edges. Unfortunately, these sampled negative examples are 
also test examples, and some of them will turn to be positive 
in the test phase, which possibly decreases the generalization 
performances. On the contrary, we identify the reciprocity 
prediction is better modeled as an outlier detection problem. 

Differences with other link mining tasks: Reciprocity 

prediction is related to a few other link mining tasks such 
as hnk prediction and link sign prediction, and link type pre- 
diction, but they are essentially different. 

The classical link prediction problem aims to identify links 
that are missing in the current social network snapshot but 
are possible to appear in the near future [16, 9]. Reciprocity 
prediction and link prediction differ in several important as- 
pects. First, previous work [5] has shown that features work- 
ing well for link prediction arc not the most effective ones for 
reciprocity prediction. Second, in the setting of reciprocity 
prediction, a parasocial edge {u, v) already exists between u 
and V, from which we can extract features (e.g., edge age). 

The link sign prediction [15] classifies social relationships 
to be either positive (e.g., friendship) or negative (e.g., op- 
position or antagonism). This forms interesting contrasts 
with reciprocity prediction in the sense that reciprocal and 
parasocial links can easily exhibit either type of sign. 

The link type prediction [23, 24] assigns relationship cat- 
egories (e.g., advisor-advisee, classmate) to a pair of users. 
This problem implicitly treats the relationships between peo- 
ple as symmetric, i.e., {u,v) and {v,u) should belong to the 
same category, which makes it fundamentally different from 
reciprocity prediction. 

8. CONCLUSION AND FUTURE WORK 

In this paper, we first study the structure and evolution 
of reciprocal and parasocial edges in Googlo+ and Flickr. 
We find that reciprocal edges mainly connect users with 
similar degrees while parasocial edges mainly link ordinary 
users (e.g., low-degree users) and popular users (e.g., celebri- 
ties). However, the impacts of reciprocal edges linking or- 
dinary and popular users on the network structures become 
increasingly significant as the social networks evolve. More- 
over, parasocial edges, although making nodes' parasocial 
neighbors more than their reciprocal neighbors, connect the 
parasocial neighbors more tightly. Second, we find that user 
behaviors, node and edge attributes all have significant in- 
fluences on the formation of reciprocal edges. Third, in con- 
trast to previous studies that treat reciprocity prediction as 
either a supervised or a semi-supervised learning problem, 
we identify that reciprocity prediction is better modeled as 
an outlier detection problem. Finally, we perform exten- 
sive evaluations with the Google-I- and Flickr datasets, and 
demonstrate that our proposal outperforms previous ones. 

One interesting future work could bo to design a directed 
network model that matches real networks with respect to 
not only directed network metrics but also undirected ones 
of the corresponding parasocial and reciprocal versions. It's 
also interesting to design an outlier detecter incorporating 
the structural balance features. 
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